Discovering structural correlations in alpha-helices.
نویسندگان
چکیده
We have developed a new representation for structural and functional motifs in protein sequences based on correlations between pairs of amino acids and applied it to alpha-helical and beta-sheet sequences. Existing probabilistic methods for representing and analyzing protein sequences have traditionally assumed conditional independence of evidence. In other words, amino acids are assumed to have no effect on each other. However, analyses of protein structures have repeatedly demonstrated the importance of interactions between amino acids in conferring both structure and function. Using Bayesian networks, we are able to model the relationships between amino acids at distinct positions in a protein sequence in addition to the amino acid distributions at each position. We have also developed an automated program for discovering sequence correlations using standard statistical tests and validation techniques. In this paper, we test this program on sequences from secondary structure motifs, namely alpha-helices and beta-sheets. In each case, the correlations our program discovers correspond well with known physical and chemical interactions between amino acids in structures. Furthermore, we show that, using different chemical alphabets for the amino acids, we discover structural relationships based on the same chemical principle used in constructing the alphabet. This new representation of 3-dimensional features in protein motifs, such as those arising from structural or functional constraints on the sequence, can be used to improve sequence analysis tools including pattern analysis and database search.
منابع مشابه
Discovering Side-Chain Correlation in α-Helices
Using a new representation for interactions in protein sequences based on correlations between pairs of amino acids, we have examined α-helical segments from known protein structures for important interactions. Traditional techniques for representing protein sequences usually make an explicit assumption of conditional independence of residues in the sequences. Protein structure analyses, howeve...
متن کاملDiscovering Side-Chain Correlation in m-Helices
Using a new representation for interactions in protein sequences based on correlations between pairs of amino acids, we have examined or-helical segments from known protein structures for important interactions. Traditional techniques tbr representing protein sequences usually make an explicit assumption of conditional independence of residues in the sequences. Protein structure analyses, howev...
متن کاملبررسی تمایل مجاورت اسیدهای آمینه با یکدیگر در مارپیچهای آلفا
In order to study the tendency of amino acid neighbors in helical structures, proteins with known structures were carefully analyzed. The studied helical positions: N , Ncap, N1, N2, N3, N4, M, C4, C3, C2, C1, Ccap, C and their doublet counterparts: N Ncap, NcapN1, N1N2, N2N3, N3N4, M1M2, M2M3, C4C3, C3C2, C2C1, C1Ccap, CcapC were carefully analyzed. The propensity for all amino acids i...
متن کاملFlexibility of alpha-helices: results of a statistical analysis of database protein structures.
Alpha-helices stand out as common and relatively invariant secondary structural elements of proteins. However, alpha-helices are not rigid bodies and their deformations can be significant in protein function (e.g. coiled coils). To quantify the flexibility of alpha-helices we have performed a structural principal-component analysis of helices of different lengths from a representative set of pr...
متن کاملPredicting coiled coils by use of pairwise residue correlations.
A method is presented that predicts coiled-coil domains in protein sequences by using pairwise residue correlations obtained from a (two-stranded) coiled-coil database of 58,217 amino acid residues. A program called PAIRCOIL implements this method and is significantly better than existing methods at distinguishing coiled coils from alpha-helices that are not coiled coils. The database of pairwi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Protein science : a publication of the Protein Society
دوره 3 10 شماره
صفحات -
تاریخ انتشار 1994